Organizing Gaussian mixture models into a tree for scaling up speaker retrieval

نویسندگان

Jamal E. Rougui

Marc Gelgon

Driss Aboutajdine

Noureddine Mouaddib

Mohammed Rziza

چکیده

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés. Organizing Gaussian mixture models into a tree for scaling up speaker retrieval Jamal Rougui, Marc Gelgon, D. Aboutajdine, Noureddine Mouaddib, M. Rziza

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of clustering methods: A case study of text-independent speaker modeling

Clustering is needed in various applications such as biometric person authentication, speech coding and recognition, image compression and information retrieval. Hundreds of clustering methods have been proposed for the task in various fields but, surprisingly, there are few extensive studies actually comparing them. An important question is how much the choice of a clustering method matters fo...

متن کامل

A Hybrid GMM/SVM System for Text Independent Speaker Identification

This paper proposes a novel approach that combines statistical models and support vector machines. A hybrid scheme which appropriately incorporates the advantages of both the generative and discriminant model paradigms is described and evaluated. Support vector machines (SVMs) are trained to divide the whole speakers’ space into small subsets of speakers within a hierarchical tree structure. Du...

متن کامل

Microsoft Word - djemili_rafik_paper.DOC

متن کامل

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

Towards a more efficient SVM supervector speaker verification system using Gaussian reduction and a tree-structured hash

Speaker verification (SV) systems that employ maximum a posteriori (MAP) adaptation of a Gaussian mixture model (GMM) universal background model (UBM) incur a significant teststage computational load in the calculation of a posteriori probabilities and sufficient statistics. We propose a multi-layered hash system employing a tree-structured GMM which uses Runnalls’ GMM reduction technique. The ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Pattern Recognition Letters

دوره 28 شماره

صفحات -

تاریخ انتشار 2007

Organizing Gaussian mixture models into a tree for scaling up speaker retrieval

نویسندگان

چکیده

منابع مشابه

Comparison of clustering methods: A case study of text-independent speaker modeling

A Hybrid GMM/SVM System for Text Independent Speaker Identification

Microsoft Word - djemili_rafik_paper.DOC

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Towards a more efficient SVM supervector speaker verification system using Gaussian reduction and a tree-structured hash

عنوان ژورنال:

اشتراک گذاری